ABSTRACT
Accurate in silico prediction of conformational B-cell epitopes would lead to major improvements in disease diagnostics, drug design and vaccine development. A variety of computational methods, mainly based on machine learning approaches, have been developed in the last decades to tackle this challenging problem. Here, we rigorously benchmarked nine state-of-the-art conformational B-cell epitope prediction webservers, including generic and antibody-specific methods, on a dataset of over 250 antibody-antigen structures. The results of our assessment and statistical analyses show that all the methods achieve very low performances, and some do not perform better than randomly generated patches of surface residues. In addition, we also found that commonly used consensus strategies that combine the results from multiple webservers are at best only marginally better than random. Finally, we applied all the predictors to the SARS-CoV-2 spike protein as an independent case study, and showed that they perform poorly in general, which largely recapitulates our benchmarking conclusions. We hope that these results will lead to greater caution when using these tools until the biases and issues that limit current methods have been addressed, promote the use of state-of-the-art evaluation methodologies in future publications and suggest new strategies to improve the performance of conformational B-cell epitope prediction methods.
Subject(s)
Epitopes, B-Lymphocyte , Spike Glycoprotein, Coronavirus , Humans , Computational Biology/methods , Epitopes, B-Lymphocyte/immunology , SARS-CoV-2 , Spike Glycoprotein, Coronavirus/immunologyABSTRACT
Antibodies recognize their cognate antigens with high affinity and specificity, but the prediction of binding sites on the antigen (epitope) corresponding to a specific antibody remains a challenging problem. To address this problem, we developed AbAdapt, a pipeline that integrates antibody and antigen structural modeling with rigid docking in order to derive antibody-antigen specific features for epitope prediction. In this study, we systematically assessed the impact of integrating the state-of-the-art protein modeling method AlphaFold with the AbAdapt pipeline. By incorporating more accurate antibody models, we observed improvement in docking, paratope prediction, and prediction of antibody-specific epitopes. We further applied AbAdapt-AF in an anti-receptor binding domain (RBD) antibody complex benchmark and found AbAdapt-AF outperformed three alternative docking methods. Also, AbAdapt-AF demonstrated higher epitope prediction accuracy than other tested epitope prediction tools in the anti-RBD antibody complex benchmark. We anticipate that AbAdapt-AF will facilitate prediction of antigen-antibody interactions in a wide range of applications.